Software framework for hyperparameters optimization of models with additive regularization
Annotation
The processing of unstructured data, such as natural language texts, is one of the urgent tasks in the development of intelligent products. In turn, topic modeling as a method of working with unmarked and partially marked text data is a natural choice for analyzing document bodies and creating vector representations. In this regard, it is especially important to train high-quality thematic models in a short time which is possible with the help of the proposed framework. The developed framework implements an evolutionary approach to optimizing hyperparameters of models with additive regularization and high results on quality metrics (coherence, NPMI). To reduce the computational time, a mode of working with surrogate models is presented which provides acceleration of calculations up to 1.8 times without loss of quality. The effectiveness of the framework is demonstrated on three datasets with different statistical characteristics. The results obtained exceed similar solutions by an average of 20 % in coherence and 5 % in classification quality for two of the three datasets. A distributed version of the framework has been developed for conducting experimental studies of topic models. The developed framework can be used by users without special knowledge in the field of topic modeling due to the default data processing pipeline. The results of the work can be used by researchers to analyze topic models and expand functionality.
Keywords
Постоянный URL
Articles in current issue
- Characterization of the holographic photopolymer Bayfol HX in the IR region
- Study of blood vessels reaction to local heating by imaging photoplethysmography
- Transmission of 3D holographic information over a radio channel by a method close to SSB
- Anodization parameters influence on anodic aluminum oxide formed above the silver island film
- State estimation accuracy analysis of an induction electric drive by the algorithms of Luenberger and Kalman
- A method of optimizing the structure of hierarchical distributed control systems
- Method for identification of sinusoidal signal parameters with variable unknown amplitude
- Improvement of the automatic temperature stabilisation process in the cryovacuum unit
- Investigation on impact and wear behavior of Al6061 (SiC+Al2O3) and Al7075 (SiC+Al2O3) hybrid composites
- Computational methods to increase the speed of FPGA-based discrete wavelet transforms
- .Dialogue system based on spoken conversations with access to an unstructured knowledge base
- Multiobjective evolutionary discovery of equation-based analytical models for dynamical systems
- Probabilistic criteria for time-series predictability estimation
- Value-based modeling of economic decision making in conditions of unsteady environment
- Methodology for organizing and conducting a study to assess consumer ability
- Automated cluster analysis of communication strategies of educational telegram channels
- Computer modeling of non-Markovianprocesses based on the principle of balance of “complex probabilities”
- The objectification method of the weight coefficients for decision-making in multicriteria problems
- Grishentsev A.Yu., Goroshkov V.A., Chernov R.I.Assessment of the limits of applicability and methods of modulation of near-field magnetic coupling
- Numerical and analytical modeling of the propulsive wing and fuselage of an air taxi
- Using variable-precision feedback to improve operational speed of the current loop in GaN-inverters
- Simulation of diffusion processes during electrothermal treatment of reaction crucibles of the Fe-Sn system